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Abstract 

We introduce and investigate notions of persistent homology for p-groups and 
for coclass trees of p-groups. Using computer techniques we show that persistent 
homology provides fairly strong homological invariants for p- groups of order < 81. 
The strength of these invariants, and some elementary theoretical properties, suggest 
that persistent homology may be a useful tool in the study of prime-power groups. 

1 Introduction 

Persistent homology is a tool from applied topology that was introduced for studying 
qualitative properties of large empirical data sets [3]. At its simplest, the idea is to 
impose some metric on a data set A, choose an appropriate sequence of metric 
space inclusions X = Aq C Ai C A2 C • • • C Xn, and then study the behaviour 
of the induced homology maps ffn(Ao,F) ff„(Ai,F) F„(A2,F) ^ ■■■ ^ 
Hn{Xj^ ,W) in a given degree n. The coefficients F are typically chosen to be a 
field. Such a sequence of linear maps is then determined, up to isomorphism, by 
an upper trianguar matrix P„ = (pij) with pi^j the dimension of the image of the 
map i?n(Aj,F) — )• //„(Aj,F). In particular pi^i is the dimension of Hn{Xi,¥). The 
matrix P„ contains information on the extent to which homology n-cycles persist 
through lengths of the induced sequence. Cycles that persist for a significant length 
are deemed to be significant and, for appropriately chosen Aj, are likely to represent 
some qualitative feature of the initial data set A. Cycles that persist for only a short 
length are deemed to be less significant. 

Persistent homology analysis can be applied to any data set A for which we have 
a suitable topology, and for which we have a meaningful sequence of topological 
inclusions. It provides a concise set of numerical descriptors for homological features 
of (the sequence of inclusions associated to) A. In this paper we investigate the 
potential of applying the idea to groups and particularly to finite p-groups. 
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We can view the elements of a group G as being the vertices of a Cayley graph. 
Furthermore, we can view the Cayley graph as the 1-skeleton of the universal cover 
EG of a classifying CW-space BG. We set X = BG and construct an inclusion 
X ^ Xi from any surjective group homomorphism (p: G^Q with Xi = BQ a 
classifying space obtained by attaching cells to BG. A sequence of inclusions X ^ 
Xi ^ ... ^ X]\f corresponds to a sequence G^Qi^Q2-^ ■ ■ ■ -^Qn of surjective 
group homomorphisms, or equivalently, to an increasing sequence A'^i < < • • • < 
A^AT of normal subgroups of G (where Ni is the kernel of the composite surjection 
G~»Qi ). 

We focus on finite prime-power groups G, and on the following five normal series 
in G. 



Li(G) 
L?(G) 


= G, 


Li+i{G) 


= MG),G] 


{lower central) 


= G, 


LUG) 


= [Lm,G]{L^{G))P 


(lower p — central) 


Di{G) 


= G, 


A+i(G) 


= [A(G),A(G)] 


(derived) 


ZoiG) 


= 1, 


Zi+i{G) 


= preimage of Z{G / Zi{G)) in G 


(upper central) 


Zl{G) 


= 1, 


ZliiG) 


= elements of order < p in the 
preimage of Z{G/Zf{G)) in G 


(upper p — central) 



These five series can be regarded as functors from the category whose objects are 
groups and whose arrows are surjections of groups. They can be regarded as functors 
to the category whose objects are sequences of group homomorphisms, and whose 
morphisms are commutative diagrams of groups. So, for instance, we view Z as a 
functor which sends a surjection G — > Q to the following commutative diagram. 



G ^ G/Zi(G) ^ G/Z2(G) ^ ••• 

Q ^ Q/Zi(Q) ^ Q/Z2(Q) ^ ••• 

For F equal to any of L, L^, D, Z, Z^ we define the persistence matrix P^(G) = 
(Pi,j) to be an upper triangular matrix. For F equal to Z or Z'p and i > j the entry 
Pij is the dimension of the image of the map 

hij: Hn(B(G/F,(G)),¥p) ^ Hn(B(G/Fj(G)),¥p). 

For F equal to L, or D and i > j the entry pij is the dimension of the image of 
the map 

hij: Hr^(B(G/FJ(G)),¥p) ^ Hn(B(G/F,(G)),¥p). 

The family H^(G) = {hij}i>j is called a persistence module and is a functorial 
invariant of the group G. Two persistence modules are isomorphic if and only if the 
corresponding persistence matrices are identical. 

Our aim is to investigate the extent to which persistence matrices can be used to 
determine the structure of finite p-groups. For instance, we use computer techniques 
[3 El El E] to establish that the degree seven upper central series persistence ma- 
trix Pf (G) yields 181 distinct matrices when G ranges over the 267 groups of order 
64. Furthermore, the groups of order 64 give rise to 187 distinct infinite sequences 

(G) = (P^ (G))n>i. We give analogous statistics for each of the five series F and 



(a) (b) (c) 

Figure 1: Degree 1, 2 and 3 lower central bar codes for D32 



all prime-power groups of order at most 81. We also give some elementary theo- 
retical results aimed at understanding the nature of the group-theoretic information 
contained in persistence matrices. We believe that the apparent strength of persis- 
tence matrices as group invariants, and their basic theoretical properties, suggest 
that persistent homology may be a useful tool in the study of prime-power groups. 



2 Examples and properties of persistence 



Consider the dihedral group D^2 of order 64. Using algorithms recently implemented 
in the group cohomology package [7j for SAGE (see ^6] for an overview of its algo- 
rithms) or the GAP homological algebra package HAP [4] (see [5] for an overview of 
its algorithms) one can compute the lower central series persistence matrix of 0^,2 in 
degree 2 to be 



/3 







The first row of this matrix implies that i?2(-D32,F2) has dimension 3, and that 
precisely two basis elements persist (i.e. remain non-zero) under the induced maps 
i^2(^32,IF2) H2{D32/Li{D-i2),^2) (2 < i < 5). The second row implies that 
H2{D'i2 / L^{D'i2) -,^2) has a basis of three elements, precisely two of which persist un- 
der the induced maps i?2 (1)32 /i^5 (^32), F2) ^ H2{D32/Li{D32),¥2) (2 < i < 4). The 
matrix is represented by the persistence bar code shown in Figure [T][|b) and, in fact, 
can be reconstructed from the information in this bar code. Persistence bar codes 
for the matrices ^^(1)32), n = 1,3, are given in Figures [l](a) and (c). Persistence 
bar codes for the matrices Pn{Q32), n = 1,2,3, associated to the quaternion group 
of order 64 are given in Figure [2j Persistence bar codes for the matrices P^{QD32), 
n = 1,2,3, associated to the quasi-dihedral group of order 64 are given in Figure 
[3l The use of bar codes for describing persistence matrices was introduced by G. 
Carlsson et al. in [2]. 

We are interested in the extent to which group-theoretic information is retained 



(a) 



(b) 



(c) 



Figure 2: Degree 1, 2 and 3 lower central bar codes for Qs2 
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(b) 



(c) 



Figure 3: Degree 1, 2 and 3 lower central bar codes for Q-D32 



by persistence matrices for the functors L, L^, D, Z, Z'f. We shall let F denote any 
one of the alcove five functors and let Fi{G) denote the ith term in the corresponding 
normal subseries of G. 

Proposition 1 Let G he a p-group. 

(i) For F any of the above five functors the first persistence matrix Pf (G) determines 
the minimal number of generators of G / Fi{G) for all i >1. 

(a) For F = L or Z the first persistence matrix Pf{G) determines the nilpotency 

class of G. 

( Hi ) For F = or Z^ the first two persistence matrices Pf{G) and P2 {G) determine 
the order of G. 

Proof. Assertion (i) follows from the fact that the dimension of the vector space 
Hi{G/ Fi{G),¥p) = G/Fi{G)[G,G]G^ is equal to the minimal number of generators 
of G/Fi{G). This dimension is the entry pi^i in the first persistence matrix. Assertion 

(ii) is just the observation that the number of columns in the persistence matrix is 
by definition equal to the length of the upper or lower central series of G. We prove 
assertion (iii) just for the functor F = Z^. We use induction on the length k of 
the upper p-central series. If k=l then G is the trivial group. If /c = 2 then G is 
an elementary abelian p-group of order p'^ where d can be determined by (i). As 
an inductive hypothesis suppose that the assertion is true when the upper p-central 
series has length k. For G a group with upper p-central series of length k + lwe set 



Figure 4: First upper 2-central bar code for G = C2 x C4 x C4 x Cie 



Q = G IZ\(G). The five term natural exact homology sequence 

i/2(G,Fp) ^ i/2(Q,Fp) ^ ^ //i(G,Fp) ^ i/i(Q,Fp) ^ (1) 

allows us to determine the dimension of the vector space (G) from the first two 
upper p-central persistence matrices. By the inductive hypothesis we can determine 
the order of Q from these two matrices. Then we have the order |G| = |(5||^f| as 
required. □ 

Proposition 2 The abelian invariants of an abelian p-group G are determined by 
the first upper p-central persistence matrix Pf''{G). 

Proof. We can work by induction on the length k of the upper p-central series. If 
k = 1 then the persistence matrix has just one entry, namely the dimension of the 
elementary abelian group G. In general we set Q = G/Z^{G) and, as an inductive 
hypothesis, assume the proposition true for Q. Any surjection G — > Q of abelian 
groups induces a surjection in second homology H2{G,¥p) — )• H2{Q,¥p). The exact 
sequence (1) thus allows us to determine the dimension d of the vector space Zf{G) 
from P^^{G). The abelian invariants of G can be obtained from those of Q by 
multiplying precisely d of the highest abelian invariants of Q by p. As an example, 
the bar code for {G) is given in Figure H] for the group G = C2 x C4 x (^4 x Gig. 
□ 

We regard a bar code as a directed graph whose vertices are arranged in columns, 
whose edges connect certain pairs of vertices in neighbouring columns, and where all 
edges are horizontal and directed away from the first column. 

Proposition 3 Let G be a finite p-group. 

(i) The bar code for the first lower central persistence matrix P/"(G) consists of 
d = rank(G/[G, G]G^) horizontal paths, each starting in the first column and ending 
in the final column. 

(a) In the bar code for the second lower central persistence matrix P^(G) every 
horizontal path starts in the first column. 

(Hi) In the bar code for the second lower central persistence matrix P-^iG) let v be the 
number of vertices in the jth column (j > 2) that are not incident with an edge. Set 
j' = c+2—j where c is the nilpotency class ofG. Then v is the dimension of the vector 
space Lj'{G)/ Lj'^i{G) 0¥p. (In particular, the number of vertices in the right-most 
column not incident with an edge is equal to the rank of L2{G) / L'i{G) (^¥p.) 
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Proof. Assertion (i) follows from the canonical isomorphisms Hi{G/Li{G),¥p) = 
G/[G,G]GP for i > 0. Assertion (ii) follows from the natural exact sequences 

H2{G/L,+i{G),¥p) ^ H2{G/L,(G),¥p) ^ L,(G)/L,+i ® ^ 

H2iG,¥p) ^ H2{G/U{G),¥p) ^ U{G)/U+i®¥p ^ 

that arise as part of the five term exact sequence in mod-p homology. These two se- 
quences imply that the two homomorphisms H2{G / Li+i{G) ,¥p) H2{G / Li{G) ,¥ p) 
and H2{G,¥p) H2{G / Li{G),¥p) have identical images. This identity implies (ii). 
Asertion (iii) follows from the second of the exact sequences. □ 

3 Persistence matrices as group invariants 

Proposition [2] states that is a complete invariant for abelian p- groups. We 
now investigate the strength of persistent homology as a group invariant for finite 
prime-power groups of low order. The computations were made using the second 
author's group cohomology package [7j for the computer algebra system Sage [9]. 
Where possible, the computations were corroborated using the first author's GAP 
package [4J). We begin with the following summary of the computations for (G) = 

{Pfi {G))n>l- 

Theorem 4 For the 366 groups of order at most 81: 

(i) P^ partitions the groups into 277 classes with maximum class size equal to 7. 

(ii) P^^ partitions the groups into 262 classes with maximum class size equal to 8. 

(iii) P^ partitions the groups into 227 classes with maximum class size equal to 7. 

(iv) P^'' partitions the groups into 179 classes with maximum class size equal to 15. 

(v) P^ partitions the groups into 180 classes with maximum class size equal to 13. 

A more detailed description of the computations is given in Tables [1] and [2] which 
contain, for prime-powers A; = 8, 16, 27, 32, 64, 81, and for each of the five functors F: 

1. the number Nr(A;) of isomorphism classes of groups of order k. 

2. the integer pair (|C|,max) where \G\ is the number of classes of groups of order 
k distinguished by the invariant Pf, and max is the maximum size of a class. 

3. the smallest integer t for which the invariant Pf^^ = {Pn)n<t is as strong as 
P^ on the groups of order k. 

4. an integer triple (|C|,max, c?) where \C\ is the number of classes of groups of 
order k distinguished by the matrix P^ , and max is the maximum size of a 
class (for some choice of d). 

Persistence bar codes can sometimes distinguish between very similar groups. 
Consider for example the two groups G, G' of order 64 which are given the identifi- 
cation numbers 158 and 160 in the Small Groups Library [1] that is available in GAP. 
Their mod-2 cohomology rings H*{G,¥2) and H* (G' ,¥2) have: the same Poincare 
series; the same number of generators and relations sorted by degree; the same depth; 
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F 




k = 8 


A; = 16 


k = 27 


A; = 32 


A; = 64 


A; = 81 




Nr(A;) 


5 


14 


5 


51 


267 


15 


Z 


( C , max) 


(5,1) 


(13,2) 


(5,1) 


(44,2) 


(187, 7) 


(14,2) 




t 


3 


4 


3 


5 


6 


5 




( C , max, d) 


(5,1,3) 


(13,2,4) 


(5,1,3) 


(44,2,7) 


(181,7,7) 


(14,2,5) 




( C , max) 


(5,1) 


(13,2) 


(5,1) 


(42,3) 


(174,8) 


(14,2) 




t 


3 


4 


3 


5 


6 


5 




( C , max, d) 


(5,1,3) 


(13,2,4) 


(5,1,3) 


(42,3,5) 


(166,8,7) 


(14,2,5) 



Table 1: 



the same a-invariants. Both groups have nilpotency class 3 and their upper central 
series admit isomorphisms Zn{G) = Zn{G') , Z„_|_i(G)/Z„(G) = Zn+i{G') / Zn{G') 
for 1 < n < 3. Their p-upper central series, lower central series, p-lower central series 
and derived series admit analogous isomorphisms. However, their upper central bar 
codes shown are different in degree 3 (see Figure [5]) . 

4 Integral persistence 

One can use integral homology groups in place of mod p homology groups when study- 
ing persistence. However, the induced homology homomorphisms Hn{f) ■ Hn{G, Z) — > 



Figure 5: Degree 3 upper central bar codes for groups 158 and 160 of order 64. 



F 




k = 8 


k=16 


k = 27 


k^32 


/c = 64 


A; = 81 




Nr(A;) 


5 


14 


5 


51 


267 


15 


L 


( C , max) 


(5,1) 


(12,2) 


(5,1) 


(37,3) 


(145,7) 


(14,2) 




t 


3 


5 


3 


5 


6 


5 




( C , max, d) 


(5,1,3) 


(12,2,4) 


(5,1,3) 


(37,3,5) 


(144,7,9) 


(14,2,5) 


LP 


( C , max) 


(4,2) 


(9,2) 


(5,1) 


(28,5) 


(110,15) 


(14,2) 




t 


3 


4 


3 


5 


6 


5 




( C , max, d) 


(4,2,3) 


(9,2,4) 


(5,1,3) 


(28,5,5) 


(109,15,9) 


(14,2,5) 


D 


( C , max) 


(5,1) 


(10,2) 


(5,1) 


(29,5) 


(108, 13) 


(14,2) 




t 


3 


4 


3 


5 


6 


5 




( C , max, d) 


(5,1,3) 


(10,2,4) 


(5,1,3) 


(29,5,7) 


(106,13,11) 


(14,2,5) 



Table 2: 







A; = 8 


A; = 16 


A; = 27 


A; = 32 


A; = 64 


A; = 81 




Nr(A;) 


5 


14 


5 


51 


267 


15 


ttZP 
-n*<3 


( C , max) 


(5,1) 


(13,2) 


(5,1) 


(46,3) 


(188,8) 


(14,2) 



Table 3: 



Figure 6: Co class graph G(2, 1) 



Hn{Q, can not be fully described using the notion of dimension. A partial descrip- 
tion is given by the abelian invariants of the source, the target and the cokernel of 
Hn{f). It is partial due the the extension problem. We denote by IP^{G) the upper 
triangular matrix whose entry in the z-th row and j-th column (j > i) is a triple 
{A, B, C) containing lists of the abelian invariants of the source, target and cokernel 
of the map Hn{G/Fi{G),Z) Hn{G / Fj{G),'L). Table [3] indicates the strength of 
IP^^i = {IP^^)n<t as an invariant of the prime-power groups of order at most 81 for 
t = 3. 

5 Persistence in coclass trees 

Recall that a p-group of order and nilpotency class c is said to have coclass 
r = n — c. For a fixed p and r one can consider the graph G{p, r) whose vertices 
are the p-groups of coclass c. Two groups G and H are connected by an edge in the 
graph if there exists a normal subgroup N < H of order p such that H/N = G. 

The graph G{p, r) has infinitely many vertices (since there are infinitely many 
groups of coclass r) and is a forest of trees (since the above is the smallest non- 
trivial term of the lower central series of H). The graph G{p, r) can be stratified into 
levels by deeming all groups of order p^ to be at level I. The graph G(2, 1) is shown 
in Figure El Its three columns contain, respectively, the dihedral groups of order 2', 
the quaternion groups of order 2' and the semi-dihedral groups of order 2K Lower 
central bar codes for the three coclass 1 groups of order 128 are shown in Figures [U 

SIS 

Much is known about the graph G(p, r). A good general reference is the book 
by Leedham-Green and McKay [8j. It is known that the graph is always a forest 
containing finitely many trees together with finitely many sporadic groups. Further- 
more, each tree has a unique maximal path of infinite length. In the case of 2-groups 
it is known that G{p,r) has bounded width [i.e. there exists some integer / such 
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that any vertex is at most / edges away from the infinite maximal path). In the 
case of G(2, 1) the infinite path runs through all the dihedral 2-groups, the sporadic 
group is the cyclic group of order 4, and the width is / = 2. 

Given a coclass tree T in G(p, r) we denote by Sj the inverse limit of the infinite 
path. It is known that Si is a p-adic space group. Its translation subgroup is an 
abelian normal subgroup T < Sj. One defines the relative lower central series Tn by 
Ti = T and T„ = [T^-IjS't]. The quotients Sj/Tn {n > 1) are precisely the groups 
on the infinite path in T. 

We want to define the persistent homology of a coclass tree T. Let Gi denote the 
p-group at level I on the infinite path of a coclass tree T. Let Im(j/^'^) denote the 
image of the canonical homology homomorphism z/^'^: Fp) Hn{Gi,¥p). 

Definition 5 The I -persistent homology of T in degree n is the subgroup 

oo 
k=l 

of the degree n homology group Hn{Gi,¥p). 

Note that there is a canonical infinite sequence of surjective homomorphisms 

• • • ^ Pl+2Hn{T) ^ Pi+iHn{T) ^ PiHniT) . (2) 

Definition 6 We define the persistent homology PHn{T) of a coclass tree T to be 
the inverse limit of the sequence (2) of surjections. 

The philosophy is that Pif„(T) should capture some group-theoretic properties 
that are common to all groups in the tree. Easy results in this direction are parts 
(ii) and (iv) of the following proposition. Part (i) of the proposition implies that 
the surjections in (2) are isomorphisms for all sufficiently large /. Hence PHn{T) = 
lmage{Hn{Gi^i,¥p) Hn{Gi,¥p)) for all groups G; on the infinite path in the tree 
above a certain level. 

Proposition 7 (i) The persistent homology Pi?„(T) is a finite dimensional vector 
space for all degrees n > 1 . 

(ii) The dimension of PHi(T) equals the minimum number of generators for any 
group in the tree. 

(Hi) H2{G,¥p) = PH2{T)(B¥p for all groups G above a certain level in the tree which 
are not leaves. For leaves, the dimension of H2{G,¥p) is at least that of PH2{T). 
(iv) Any group G in the tree needs at least dim(Pf/'2(T)) relators to define it. If the 
group is not a leaf it needs at least dim(PiJ2(T)) + 1 relators. 

Proof, (i) The p-adic space group associated to T decomposes into a short exact 
sequence 1— >T— t'S't— >P— t-I where P is a finite (point) group. Each group Gi on 
the infinite path of T thus fits into a short exact sequence 1 ^ T/T/ -> P ^ 1. 

Let i?f ^ Z be any free ZP-resolution of the integers. Let R^^'^''^ ^ Z be the 
minimal free Z(r/T;)-resolution of the integers constructed as a tensor product of 
resolutions of the cyclic summands of T/Ti. Note that the number of free generators 
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of in a given degree is independent of /. By a Lemma of C.T.C. Wall the 

boundary map in the tensor product R^J'^'^^'^ can be perturbed to produce a free 
ZG/-resolution i?* ^R^^ . By construction, the number of generators of this latter 
resolution, in a given degree, is independent of I. This implies that for a given n the 
dimension of the homology groups Hn{Gi,¥p) is bounded by a number depending 
only on P and T. This means that the sequence of dimensions of the vectors spaces 
PiHn{T), Pi^iHn{T), ■ ■ ■ is bounded above. The sequence is also monotonically in- 
creasing since the maps in (2) are surjective. Hence the sequence of dimensions 
converges to the dimension of the inverse limit. 

(ii) This follows directly from the definition of PHi(T) and the isomorphism 
Hi{G,¥p)^G/[G,G]GP. 

(iii) Let — )■ be a homomorphism in the tree from level / + 1 to level I with 
kernel K of order p. The five term natural exact homology sequence 

H2{Gi+,,¥p) ^ H2{Gi,¥p) ^K^ Hi{Gi+i,¥p) 4 Hi{Gu¥p) ^ (3) 

implies Image(i?„(G,+i,Fp) ^ Hn{Gu¥p)) ®K ^ H2{Gi,¥p). If G^+i happens to 
be on the infinite path in the tree then, by (i), the first term in this sum stabilizes 
to PH2(T) for large /. If is not on the infinite path but is not a leaf, then (i) 
together with Proposition [S^ii) show that first term in the sum stabilizes to PH2{T). 
If G happens to be a leaf then we can at least conclude from Proposition [3] that 
H2{Gi+i,¥p) maps onto PH2{T). 

(iv) A presentation for G yields the low-dimensional terms in a free ZG-resolution 
i?^Z where the ZG-rank of R2 equals the number of relators in the presentation. 
Clearly this rank has to be at least the dimension of H2{G,¥p). So (iii) gives the 
required result. □ 

The persistent homology could easily be computed for some coclass trees. For 
instance, computer calculations strongly suggest the following result, whose proof 
should be just a routine homological calculation. 

Conjecture 8 For T the infinite tree in G(2, 1) we have 

PHn{T) =¥2®¥2 (n>l). 
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